Ser. No. 09/748,433 



IN THE CLAIMS: 

1. (Currently amended) A method of training a scoring matrix for use by a classification system, 
the classification system for use in performing classification requests based on natural language 
text and with use of said scoring matrix which has been based on a set of training data comprising 
natural language text, the method comprising the steps of: 

generating an initial scoring matrix comprising a numerical value for each of a set of n 
classes in association with each of a set of m features, the initial scoring matrix based on said set of 
training data and, for each element of said set of training data, based on a subset of said features 
which are comprised in said natural language text of said element of said set of training data and 
on one of said classes which has been identified therefor; and 

based on the initial scoring matrix and said set of training data, generating a discriminatively 
trained scoring matrix for use by said classification system by adjusting one or more of said 
numerical values such that a greater degree of discrimination exists between competing ones of 
said classes when said classification requests are performed, thereby resulting in a reduced 
classification error rate^ 

wherein said step of adjusting said numerical values is performed with use of a Generalized 
Probabilistic Descent algorithm and 

wherein said step of adjusting said numerical values comprises, for each element of said set 
of training data, modifying values associated with the identified class and values associated with 
one or more of the other classes such that a score obtained for said element of said set of training 
data based on said modified values associated with the identified class is improved relative to one 
or more scores obtained for said element of said set of training data based on said modified values 
associated with said one or more other classes . 

2. Canceled. 

3. (Currently amended) The method of claim [2] i wherein said step of adjusting said numerical 
values comprises iteratively adjusting said numerical values until a stopping criterion is met. 



-2- 



Ser. No. 09/748,433 



4. (Original) The method of claim 3 wherein said stopping criterion comprises an empirical loss 
threshold . 



5. (Original) The method of claim 3 wherein said stopping criterion comprises a classification 
error rate threshold. 



6. Canceled. 

7. (Currently amended) The method of claim [6] i wherein said values are modified based on the 
following equation of the form: 
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8. (Original) The method of claim 1 wherein said classification requests comprise call routing 
requests, wherein said classes comprise call routing destinations, and wherein said set of training 
data comprises call routing requests together with associated identified call routing destinations 
therefor. 



9. (Original) The method of claim 1 wherein said classification requests comprise document 
retrieval requests, wherein said classes comprise documents, and wherein said set of training data 
comprises document retrieval requests together with associated identified documents therefor. 

10. (Original) The method of claim 1 wherein said natural language text upon which said 
classification requests are based and said natural language text comprised in said set of training 
data is processed without the use of stop word filtering. 
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11. (Currently amended) A method of performing classification requests based on natural 
language text and with use of a discriminatively trained scoring matrix which has been trained 
based on a set of training data comprising natural language text, the scoring matrix having been 
discriminatively trained by a method comprising the steps of: 

generating an initial scoring matrix comprising a numerical value for each of a set of n 
classes in association with each of a set of m features, the initial scoring matrix based on said set of 
training data and, for each element of said set of training data, based on a subset of said features 
which are comprised in said natural language text of said element of said set of training data and on 
one of said classes which has been identified therefor; and 

based on the initial scoring matrix and said set of training data, generating said 
discriminatively trained scoring matrix by adjusting one or more of said numerical values such that 
a greater degree of discrimination exists between competing ones of said classes when said 
classification requests are performed, thereby resulting in a reduced classification error rate^ 

wherein said step of adjusting said numerical values is performed with use of a Generalized 
Probabilistic Descent algorithm and 

wherein said step of adjusting said numerical values comprises, for each element of said set 
of training data, modifying values associated with the identified class and values associated with 
one or more of the other classes such that a score obtained for said element of said set of training 
data based on said modified values associated with the identified class is improved relative to one 
or more scores obtained for said element of said set of training data based on said modified values 
associated with said one or more other classes . 

12. Canceled. 

13. (Currently amended) The method of claim [12] \\ wherein said step of adjusting said 
numerical values comprises iteratively adjusting said numerical values until a stopping criterion is 
met. 

14. (Original) The method of claim 1 3 wherein said stopping criterion comprises an empirical loss 
threshold . 
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15. (Original) The method of claim 13 wherein said stopping criterion comprises a classification 
error rate threshold. 

16. Canceled. 

17. (Currently amended) The method of claim [16] H wherein said values are modified based on 
the following equation of the form: 

ifv= k 



if V It k 

18. (Original) The method of claim 1 1 wherein said classification requests comprise call routing 
requests, wherein said classes comprise call routing destinations, and wherein said set of training 
data comprises call routing requests together with associated identified call routing destinations 
therefor. 

19. (Original) The method of claim 1 1 wherein said classification requests comprise document 
retrieval requests, wherein said classes comprise documents, and wherein said set of training data 
comprises document retrieval requests together with associated identified documents therefor. 

20. (Original) The method of claim 1 1 wherein said natural language text upon which said 
classification requests are based and said natural language text comprised in said set of training 
data is processed without the use of stop word filtering. 

21. (Currently amended) An apparatus for training a scoring matrix for use by a classification 
system, the classification system for use in performing classification requests based on natural 
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language text and with use of said scoring matrix which has been based on a set of training data 
comprising natural language text, the apparatus comprising: 

means for generating an initial scoring matrix comprising a numerical value for each of a 
set of n classes in association with each of a set of m features, the initial scoring matrix based on 
said set of training data and, for each element of said set of training data, based on a subset of said 
features which are comprised in said natural language text of said element of said set of training 
data and on one of said classes which has been identified therefor; and 

based on the initial scoring matrix and said set of training data, means for generating a 
discriminatively trained scoring matrix for use by said classification system by adjusting one or 
more of said numerical values such that a greater degree of discrimination exists between 
competing ones of said classes when said classification requests are performed, thereby resulting in 
a reduced classification error rate^ 

wherein said means for adjusting said numerical values executes a Generalized 
Probabilistic Descent algorithm and 

wherein said means for adjusting said numerical values comprises, for each element of said 
set of training data, means for modifying values associated with the identified class and values 
associated with one or more of the other classes such that a score obtained for said element of said 
set of training data based on said modified values associated with the identified class is improved 
relative to one or more scores obtained for said element of said set of training data based on said 
modified values associated with said one or more other classes . 

22. Canceled. 

23. (Currently amended) The apparatus of claim [22] 2\ wherein said means for adjusting said 
numerical values comprises means for iteratively adjusting said numerical values until a stopping 
criterion is met. 

24. (Original) The apparatus of claim 23 wherein said stopping criterion comprises an empirical 
loss threshold . 



Ser. No. 09/748,433 



25. (Original) The apparatus of claim 23 wherein said stopping criterion comprises a classification 
error rate threshold. 



26. Canceled. 

27. (Currently amended) The apparatus of claim [26] 21 wherein said values are modified based 
on the following equation of the form: 



ifv = k 



if V ^ k 



28, (Original) The apparatus of claim 21 wherein said classification requests comprise call routing 
requests, wherein said classes comprise call routing destinations, and wherein said set of training 
data comprises call routing requests together with associated identified call routing destinations 
therefor. 



29. (Original) The apparatus of claim 2 1 wherein said classification requests comprise document 
retrieval requests, wherein said classes comprise documents, and wherein said set of training data 
comprises document retrieval requests together with associated identified documents therefor. 

30. (Original) The apparatus of claim 2 1 wherein said natural language text upon which said 
classification requests are based and said natural language text comprised in said set of training 
data is processed without the use of stop word filtering. 

31. (Currently amended) An apparatus for performing classification requests based on natural 
language text and with use of a discriminatively trained scoring matrix which has been trained 
based on a set of training data comprising natural language text, the scoring matrix having been 
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discriminatively trained by an apparatus comprising: 

means for generating an initial scoring matrix comprising a numerical value for each of a 
set of n classes in association with each of a set of m features, the initial scoring matrix based on 
said set of training data and, for each element of said set of training data, based on a subset of said 
features which are comprised in said natural language text of said element of said set of training 
data and on one of said classes which has been identified therefor; and 

based on the initial scoring matrix and said set of training data, means for generating said 
discriminatively trained scoring matrix by adjusting one or more of said numerical values such that 
a greater degree of discrimination exists between competing ones of said classes when said 
classification requests are performed, thereby resulting in a reduced classification error rate^ 

wherein said means for adjusting said numerical values executes a Generalized 
Probabilistic Descent algorithm and 

wherein said means for adjusting said numerical values comprises, for each element of said 
set of training data, means for modifying values associated with the identified class and values 
associated with one or more of the other classes such that a score obtained for said element of said 
set of training data based on said modified values associated with the identified class is improved 
relative to one or more scores obtained for said element of said set of training data based on said 
modified values associated with said one or more other classes . 

32. Canceled. 

33. (Currently amended) The apparatus of claim [32] 3i wherein said means for adjusting said 
numerical values comprises means for iteratively adjusting said numerical values until a stopping 
criterion is met. 

34. (Original) The apparatus of claim 33 wherein said stopping criterion comprises an empirical 
loss threshold . 

35. (Original) The apparatus of claim 33 wherein said stopping criterion comprises a classification 
error rate threshold. 
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36. Canceled. 



37. (Currently amended) The apparatus of claim [16] 3J_ wherein said values are modified based 
on the following equation of the form: 
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38. (Original) The apparatus of claim 3 1 wherein said classification requests comprise call routing 
requests, wherein said classes comprise call routing destinations, and wherein said set of training 
data comprises call routing requests together with associated identified call routing destinations 
therefor. 

39. (Original) The apparatus of claim 3 1 wherein said classification requests comprise document 
retrieval requests, wherein said classes comprise documents, and wherein said set of training data 
comprises document retrieval requests together with associated identified documents therefor. 

40. (Original) The apparatus of claim 3 1 wherein said natural language text upon which said 
classification requests are based and said natural language text comprised in said set of training 
data is processed without the use of stop word filtering. 



